Spatial Information Guided Convolution for Real-Time RGBD Semantic Segmentation

نویسندگان

چکیده

3D spatial information is known to be beneficial the semantic segmentation task. Most existing methods take data as an additional input, leading a two-stream network that processes RGB and separately. This solution greatly increases inference time severely limits its scope for real-time applications. To solve this problem, we propose Spatial guided Convolution (S-Conv), which allows efficient feature integration. S-Conv competent infer sampling offset of convolution kernel by information, helping convolutional layer adjust receptive field adapt geometric transformations. also incorporates into learning process generating spatially adaptive weights. The capability perceiving geometry largely enhanced without much affecting amount parameters computational cost. We further embed network, called Guided Network (SGNet), resulting in state-of-the-art performance on NYUDv2 SUNRGBD datasets.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Understanding Convolution for Semantic Segmentation

Recent advances in deep learning, especially deep convolutional neural networks (CNNs), have led to significant improvement over previous semantic segmentation systems. Here we show how to improve pixel-wise semantic segmentation by manipulating convolution-related operations that are of both theoretical and practical value. First, we design dense upsampling convolution (DUC) to generate pixel-...

متن کامل

Real-time Semantic Image Segmentation via Spatial Sparsity

We propose an approach to semantic (image) segmentation that reduces the computational costs by a factor of 25 with limited impact on the quality of results. Semantic segmentation has a number of practical applications, and for most such applications the computational costs are critical. The method follows a typical two-column network structure, where one column accepts an input image, while th...

متن کامل

Real-Time Semantic Clothing Segmentation

Clothing segmentation is a challenging field of research which is rapidly gaining attention. This paper presents a system for semantic segmentation of primarily monochromatic clothing and printed/stitched textures in single images or live video. This is especially appealing to emerging augmented reality applications such as retexturing sports players’ shirts with localized adverts or statistics...

متن کامل

Toward Real-time Indoor Semantic Segmentation Using Depth Information

This work addresses multi-class segmentation of indoor scenes with RGB-D inputs. While this area of research has gained much attention recently, most works still rely on handcrafted features. In contrast, we apply a multiscale convolutional network to learn features directly from the images and the depth information. Using a frame by frame labeling, we obtain nearly state-of-the-art performance...

متن کامل

Rethinking Atrous Convolution for Semantic Image Segmentation

In this work, we revisit atrous convolution, a powerful tool to explicitly adjust filter’s field-of-view as well as control the resolution of feature responses computed by Deep Convolutional Neural Networks, in the application of semantic image segmentation. To handle the problem of segmenting objects at multiple scales, we design modules which employ atrous convolution in cascade or in paralle...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE transactions on image processing

سال: 2021

ISSN: ['1057-7149', '1941-0042']

DOI: https://doi.org/10.1109/tip.2021.3049332